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Abstract. Approximation theory is concerned with the ability to ap- 
proximate functions by simpler and more easily calculated functions. The 
first question we ask in approximation theory concerns the possibility of 
approximation. Is the given family of functions from which we plan to ap- 
proximate dense in the set of functions we wish to approximate? In this 
work we survey some of the main density results and density methods. 
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1 Introduction 

Approximation theory is that area of analysis which, at its core, is concerned 
with the ability to approximate functions by simpler and more easily calculated 
functions. It is an area which, like many other fields of analysis, has its primary 
roots in the mathematics of the 19th century. 

At the beginning of the 19th century functions were essentially viewed via 
concrete formulae, series, or as solutions of equations. However largely as a 
consequence of the claims of Fourier and the results of Dirichlet, the modern 
concept of a function distinguished by its requisite properties was introduced 
and accepted. Once a function, and more specifically a continuous function, 
is defined implicitly rather than explicitly, the birth of approximation theory 
becomes an inevitable and unavoidable development. 

It is in the theory of Fourier series that we find some of the first results of 
approximation theory. These include conditions on a function that ensure the 
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pointwisc or uniform convergence (of the partial sums) of its Fourier series, as 
well as the omnipresent L 2 -convergence. Similar results were also developed for 
other orthogonal series, and for power series (analytic functions) . However these 
results are of a rather particular form. They are concerned with conditions for 
when certain formulae hold. In the classical theory of Fourier series one does 
not ask if trigonometric polynomials can be used to approximate, or even if 
the information provided by the Fourier coefficients is sufficient to provide an 
approximation. Rather one wants to know if the partial sums of the Fourier 
series converge to the function in question. 

The first question we ask in approximation theory concerns the possibil- 
ity of approximation. Is the given family of functions from which we plan to 
approximate dense in the set of functions we wish to approximate? That is, 
can we approximate any function in our set, as well as we might wish, using 
arbitrary functions from our given family? In this work we survey some of the 
main density results and density methods. 

The first significant density results were those of Weierstrass who proved 
in 1885 (when he was 70 years old!) the density of algebraic polynomials in the 
class of continuous real- valued functions on a compact interval, and the density 
of trigonometric polynomials in the class of 27r-periodic continuous real-valued 
functions. These theorems were, in a sense, a counterbalance to Weierstrass' 
famous example of 1861 on the existence of a continuous nowhere differentiable 
function. The existence of such functions accentuated the need for analytic 
rigour in mathematics, for a further understanding of the nature of the set 
of continuous functions, and substantially influenced the further development 
of analysis. If this example represented for some a 'lamentable plague' (as 
Hcrmite wrote to Stieltjcs on May 20, 1893, see Baillard and Bourget [1905]), 
then the approximation theorems were a panacea. While on the one hand 
the set of continuous functions contains deficient functions, on the other hand 
every continuous function can be approximated arbitrarily well by the ultimate 
in smooth functions, the polynomials. 

The Weierstrass approximation theorems spawned numerous generaliza- 
tions which were applied to other families of functions. They also led to the 
development of two general methods for determining density. These are the 
Stone- Weierstrass theorem generalizing the Weierstrass theorem to subalgebras 
of C(X), X a compact space, and the Bohman-Korovkin theorem characterizing 
sequences of positive linear operators that approximate the identity operator, 
based on easily checked, simple, criteria. 

A different and more modern approach to density theorems is via "soft 
analysis" . This functional analytic approach actually dates back almost 100 
years. A linear subspace M of a normed linear space E is dense in E if and only 
if the only continuous linear functional that vanishes on M is the identically zero 
functional. For the space C[a, b] this result can already be found in the work of 
F. Riesz from 1910 and 1911 as one of the first applications of his "representation 
theorem" characterizing the set of all continuous linear functional on C[a, b]. 
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Density theorems can be found almost everywhere in analysis, and not 
only in analysis. (For a density result equivalent to the Riemann Hypothesis 
see Conrey [2003, p. 345].) In this article we survey some of the main results 
regarding density of linear subspaces in spaces of continuous real-valued func- 
tions endowed with the uniform norm. We only present a limited sampling of 
the many, many density results to be found in approximation theory and in 
other areas. A monograph many times the length of this work would not suffice 
to include all results. In addition, we do not prove all the results we quote. 
Writing a paper such as this involves compromises. We hope, nonetheless, that 
you the reader will find something here of interest. 

2 The Weierstrass Approximation Theorems 

We first fix some notation. We let C[a, b] denote the class of continuous real- 
valued functions on the closed interval [a, b], and C[a,b] the class of functions 
in C[a,b] satisfying f(a) = f(b). (C[a,b] may be regarded as the restriction 
to [a, b] of (b — a)-periodic functions in C(M).) We denote by U n the space of 
algebraic polynomials of degree at most n, i.e., 



and by T n the space of trigonometric polynomials of degree at most n, i.e., 



The paper stating and proving what we call the Weierstrass approximation 
theorems is Weierstrass [1885]. It seems that the importance of the paper was 
immediately appreciated, as the paper appeared in translation (in French) one 
year later in Weierstrass [1886]. Weierstrass was interested in complex function 
theory and in the ability to represent functions by power series and function 
series. He viewed the results obtained in this 1885 from that perspective. The 
title of the paper emphasizes this viewpoint. The paper is titled On the pos- 
sibility of giving an analytic representation to an arbitrary function of a real 
variable. We state the Weierstrass theorems, not as given in his paper, but as 
they are currently stated and understood. 

Weierstrass Theorem 2.1. For every finite a < b algebraic polynomials are 
dense in C[a,b}. That is, given an f in C[a,b] and an arbitrary e > there 
exists an algebraic polynomial p such that 



n„ = spanjl, x, . . . 



x n }, 



T n = span{l, sinx, cos a;, . . . , 



sin nai,cos nx). 



\f(x)-p(x)\ <e 



for all x in [a, 6]. 
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Weierstrass Theorem 2.2. Trigonometric polynomials are dense in C[0, 2w], 
That is, given an f in C[0, 2tt] and an arbitrary e > there exists a trigonometric 
polynomial t such that 

\f(x)-t(x)\<e 

for all x in [0, 2tt\. 

These are the first significant density theorems in analysis. They are gen- 
erally paired since in fact they are equivalent. That is, each of these theorems 
follows from the other. 

It is interesting to read this paper of Weierstrass, as his perception of these 
approximation theorems was most certainly different from ours. Weierstrass' 
view of analytic functions was of functions that could be represented by power 
series. The approximation theorem, for him, was an extension of this result to 
continuous functions. 

Explicitly, let (e„) be any sequence of positive values for which X^^Li e « < 
oo. Let p n be an algebraic polynomial (which exists by Theorem 2.1) satisfying 

\\f-Pn\\~ max \f(x) -p n (x)\ < e n , n=l,2, ... 

a<x<b 

Set q = pi and q n = p n+1 - p n , n = 1, 2, ... Then 

oo 

f( x ) = J2<ln(x). 

Thus every continuous function can be represented by a polynomial series that 
converges both absolutely and uniformly. Similarly, 'nice' functions in C[0, 2ir] 
enjoy the property that their Fourier series converges absolutely and uniformly. 
What Weierstrass proved was that every function in (7[0, 2ir] can be represented 
by a trigonometric polynomial series that converged both absolutely and uni- 
formly. 

The paper Weierstrass [1885] was reprinted in Weierstrass' Mathematis- 
che Werke (collected works) with some notable additions. While this reprint 
appeared in 1903, there is reason to assume that Weierstrass himself edited 
this paper. One of these additions was a short "introduction". We quote it 
(verbatim in meaning if not in fact). 

The main result of this paper, restricted to the one variable case, can be 
summarized as follows: 

Let f e C(M). Then there exists a sequence /i, f 2 , ■ ■ ■ of entire functions 
for which 

oo 

f{ X )=Y,m 
»=i 

for each xeft In addition the convergence of above sum is uniform on every 
finite interval. 
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Note that there is no mention of the fact that the fa may be assumed to 
be polynomials. 

Weierstrass' proof of Theorem 2.1 is rather straightforward. The same is 
not quite true of his proof of Theorem 2.2. He extends / from [a, b] so that it 
is continuous and bounded on all of M. He then smooths / by convolving it 
with the normalized heat (Gauss) kernel (l/k^/T^e - ^/^ . This "smoothed" 
is entire and is therefore uniformly approximable on the finite interval [a, b] by 
its truncated power series. Moreover the fk uniformly approximate / on [a, b] 
as k — > 0+. Together this implies the desired result. 

Over the next twenty-five or so years numerous alternative proofs were 
given to one or the other of these two Weierstrass results by a roster of some 
of the best analysts of the period. The proofs use diverse ideas and techniques. 
There are the proofs by Weierstrass, Picard, Fejer, Landau and de la Vallee 
Poussin that used singular integrals, proofs based on the idea of approximating 
one particular function by Runge (Phragmcn), Lebesgue, Mittag-LcfHcr, and 
Lerch, proofs based on Fourier series by Lerch, Volterra and Fejer, and the 
wonderful proof of Bernstein. Details concerning all these proofs can be found, 
for example, in Pinkus [2000] and Pinkus [2005]. We explain, without going 
into all the details, three of these proofs. 

One of the more elegant and cited proofs of Weierstrass' theorem is due 
to Lebesgue [1898]. This was Lebesgue's first published paper. He was, at the 
time of publication, a 23 year old student at the Ecolc Normalc Superieure. The 
idea of his proof is simple and useful. Lebesgue noted that each / in C[a, b] can 
be easily approximated by a continuous, piccewise linear curve (polygonal line). 
Each such polygonal line is a linear combination of translates of |x|. As algebraic 
polynomials (of any fixed degree) are translation invariant, it thus suffices to 
prove that one can uniformly approximate \x\ arbitrarily well by polynomials on 
any interval containing the origin. Lebesgue then does exactly that. Explicitly 



oo 

|x| = l-Y j a n {l-x 2 ) 

71=1 



where a\ = 1/2, and 

(2n- 3)! 



2 2n-2 n |( n _ !)!> n 2 ' 3 '--' 



This "power series" converges absolutely and uniformly to \x\ for all \x\ < 1. 
Truncating this series we obtain a series of polynomial approximants to \x\. 

When Fejer was 20 years old he published Fejer [1900] that formed the basis 
for his doctoral thesis. Fejer proved more than the Weierstrass approximation 
theorem (for trigonometric polynomials). He proved that for any / in C[0, 2ir] 
it is possible to uniformly approximate / based solely on the knowledge of 
its Fourier coefficients. He did not obtain this approximation by taking the 
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partial sums of the Fourier series. It is well-known that these do not necessarily 
converge. Rather, he obtained it by taking the Cesaro sums of the partial sums 
of the Fourier series. In other words, assume that we are given the Fourier series 
of/ 



/(a;)~ Cke * 



ikx 

k— — oo 

where 

1 r^n 

-ikx 



for every k G Zi. Define the nth partial sums of the Fourier series via 



s n (x) := ^2 c fe e 



kx 
k— — n 



and set 

\ s (x)-\ hs„(x) 

*„(/;*) = ^ • 

The cr„ are termed the nth Fejer operator. Note that tr n (/;-) belongs to T n 
for each n. What Fejer proved was that, for each / in C[0, 2n], a n (f; •) tends 
uniformly to / as n — > oo. This was also the first proof which used a specifically 
given sequence of linear operators. 

Simpler linear operators that approximate were introduced by Bernstein 
[1912/13]. These are the Bernstein polynomials. For / in C[0, 1] they are 
defined by 



m=0 



Bernstein proved, by probabilistic methods, that the B n (f;-) converge uni- 
formly to / as n — > oo. A proof of this convergence is to be found in Example 
4.2. 



3 The Functional Analytic Approach 

The Riesz representation theorem characterizing the space of continuous linear 
functionals on C[a, b] is contained in the 1909 paper of F. Riesz [1909]. The 
following year, in a rarely referenced paper, Riesz [1910] also announced the 
following (stated in more modern terminology). 

Theorem 3.1. Let Uk G C[a, b], k G K, where K is an index set. A necessary 
and sufficient condition for the existence of a continuous linear functional F on 
C[a, b] satisfying 

F(u k ) = c fe , keK 
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with \\F\\ <L is that 



^ a k c k < L ^ a k u k 

keK' keK' 



oo 



hold for every finite subset K' of K, and all real ak- 
in this same paper Riesz also states the parallel result for L p [a, b], 1 < p < 
oo. Questions concerning existence and uniqueness in moment problems were 
of major importance in the development of functional analysis. The full details 
of the 1910 announcement appear in Riesz [1911]. In these papers is also to be 
found the following result (again we switch to more modern terminology). 

Theorem 3.2. Let M be a linear subspace of C[a,b\. Then f e C[a,b] is in 
the closure of M, i.e., f can be uniformly approximated by elements of M, if 
and only if every continuous linear functional on C[a, b] that vanishes on M also 
vanishes on f. 

Riesz quotes E. Schmidt as the author of the very interesting problem whose 
solution is the above Theorem 3.2. As he writes, the question asked is: Being 
given a countable system of functions <p n e C[a,b], n = 1,2,..., how can one 
know if one can approximate arbitrarily and uniformly every f € C[a, b] by the 
<p n and their linear combinations? (Riesz [1911, p. 51]). Schmidt, in his thesis 
in Schmidt [1905], had given both a necessary and a sufficient condition for the 
above to hold. Both were orthogonality type conditions. However neither was 
the correct condition. The concept of a linear functional vanishing on a set of 
functions is very orthogonal- like. Lerch's theorem (Lerch [1892], see also the 
more accessible Lerch [1903]), states that if h € C[0, 1] and 



then h = 0. This theorem was well-known and frequently quoted. So it was not 
unreasonable to look for conditions of the form given in Theorem 3.2. 

Lerch's theorem is, in fact, a simple consequence of Weierstrass' theorem. 
If pk is a sequence of polynomials that uniformly approximate h, then 






However 




= 



for every k, and thus 
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which, since h is continuous, implies h = 0. 

As Riesz states, one consequence of the above Theorem 3.2 is that M is 
dense in C[a, b] if and only if no nontrivial continuous linear functional vanishes 
on M. The proof of Theorem 3.2, contained in Riesz [1911], is just an application 
of Theorem 3.1. 

Proof: We start with the simple direction. Assume / is in the closure of M. 
If F is a continuous linear functional that vanishes on M, then 

F(f) = F(f - g) 
for every g £ M. Given e > 0, there exists a, g* £ M for which 

||/-ffloo<e. 

Thus 



\F(f)\ = \F(f-g*)\<\\F 



<e\\F\\. 



As this is valid for every e > we have F(f) = 0. 

Now assume that / is not in the closure of M. Thus ||/ — g\\oo > d > 
for every g £ M. From this inequality and Theorem 3.1 there necessarily exists 
a continuous linear functional F on C[a, b] satisfying F(g) = 0, for all g € M, 
F(f) = 1, and ||F|j < L for any L > 1/d. This holds since we have 

\a\ < Ld\a\ < L\\af — g\\oc, 



for all g e M and all a. □ 

Shortly thereafter Helly [1912] applied these results to a question concern- 
ing the range of an integral operator. He proved the following two theorems. 

Theorem 3.3. Let K £ C([a, b] x [a, b]) and f £ C[a, b]. Then a necessary and 
sufficient condition for the existence of a measure of bounded total variation v 
satisfying 

f(x)= [ K(x,y)du(y), 

J a 

is the existence of a constant L for which 



^2a k f(x k ) 



k=i 



< L 



^a k K{x k ,y) 



k=l 



for all points xi,...,x n in [a, b], all real values ai, . . . , a„, all y £ [a, b], and all 
n. 
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Theorem 3.4. Let K £ C([a,b] x [a, b]). Then a necessary and sufficient 
condition for an f £ C[a, b] to be uniformly approximated by functions of the 
form 

b 

K{x,y)<j){y) Ay 

where the (p are piecewise continuous functions, is that for every measure /i of 
bounded total variation satisfying 

f K(x,y)d t i(x)=0 

J a 

we also have ^ 

f f(x)du.( y )=0. 

J a 



L 



In 1911 the concept of a normed linear space did not exist, and the Hahn- 
Banach theorem had yet to be discovered (although the Hclly [1912] paper 
contains results that come close). Banach's proof of the Hahn-Banach theorem 
appears in Banach [1929] (Hahn's appears in Hahn [1927]). Both the Hahn 
and Banach papers contain a general form of Theorems 3.1, namely the Hahn- 
Banach theorem. Both also essentially contain the statement that a linear 
subspace is dense in a normed linear space if and only if no nontrivial continuous 
linear functional vanishes on the subspace. Banach, in his book Banach [1932, 
p. 57], prefaces these next two theorems with the statement: We are now going 
to establish some theorems that play in the theory of normed spaces the analogous 
role to that which the Weierstrass theorem on the approximation of continuous 
functions by polynomials plays in the theory of functions of a real variable. 

Theorem 3.5. Let M be a linear subspace of a real normed linear space E. 
Assume f £ E and 

\\.f-g\\>d>o 

for all g £ M. Then there exists a continuous linear functional F on E such 
that F(g) = for all g £ M, F(f) = 1, and \\F\\ < l/d. 

The result of Theorem 3.5 replaces Theorem 3.1 in the proof of Theorem 
3.2 to give us the well-known 

Theorem 3.6. Let M be a linear subspace of a real normed linear space E. 
Then f £ E is in the closure of M if and only if every continuous linear func- 
tional on E that vanishes on M also vanishes on f. 

In none of these works of Hahn and Banach are the above-mentioned 1910 
or 1911 papers of Riesz mentioned. These Riesz papers seem to have been 
essentially forgotten. In fact the general method of proof of density based on 



10 



Allan Pinkus 



this approach is to be found in the literature only after the appearance of the 
book of Banach and the blooming of functional analysis. The name of Riesz is 
often mentioned in connection with this method, but only because of the Riesz 
representation theorem and similar duality results bearing his name. 

Today we also recognize the Hahn-Banach theorem as a separation theorem, 
and as such we also have the following two results. 

Theorem 3.7. Let E be a real normed linear space, <j) n elements of E, n e /, 
and f € E. Then f may be approximated by finite convex linear combinations 
of the 4> n , n € 1 , if and only if 

sup{F(0„) : n e 1} > F(f) 

for every continuous linear functional (form) F on E. 

Theorem 3.8. Let E be a real normed linear space, <j> n elements of E, n e /, 
and f € E. Then f may be approximated by finite positive linear combinations 
of the 4> n , n & I, if and only if for every continuous linear functional (form) F 
on E satisfying F(<f) n ) > for every n <G / we have F(f) > 0. 

Theorem 3.8 follows from Theorem 3.7 by considering the convex cone 
generated by the <j) n . 

There are numerous generalizations of these results. The book of Nachbin 
[1967] where these results may be found is one of the few to concentrate on 
density theorems. Much of the book is taken up with the Stone- Weierstrass 
theorem. However there are also other results such as the above Theorems 3.7 
and 3.8. 

4 Other Density Methods 

The Weierstrass theorems had a significant influence on the development of 
density results, even though the theorems themselves simply prove the density 
of algebraic and trigonometric polynomials in the appropriate spaces. Various 
proofs of the Weierstrass theorems, for example, provided insights that led to 
the development of two general methods for determining density. We discuss 
these methods in this section. 

The first of these methods is given by the Stone- Weierstrass theorem. This 
theorem was originally proven in Stone [1937]. Stone subsequently reworked 
his proof in Stone [1948]. It represents, as stated by Buck [1962, p. 4], one of 
the first and most striking examples of the success of the algebraic approach to 
analysis. There have since been numerous modifications and extensions. See, 
for example, Nachbin [1967], Prolla [1993] and references therein. 

We recall that an algebra is a linear space on which multiplication between 
elements has been suitably defined satisfying the usual commutative and asso- 
ciative type postulates. Algebraic and trigonometric polynomials in any finite 
number of variables are algebras. A set in C(X) separates points if for any 
distinct points x,y £ X there exists a g in the set for which g(x) ^ g(y)- 
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Stone- Weierstrass Theorem 4.1. Let X be a compact set and let C(X) 
denote the space of continuous real-valued functions defined on X. Assume A 
is a subalgebra of C(A). Then A is dense in C(X) in the uniform norm if and 
only if A separates points and for each x G X there exists an / G A satisfying 
f(x) ± 0. 

Proof: The necessity of the two conditions is obvious. We prove the sufficiency. 

First some preliminaries. From the Weierstrass theorem we have the ex- 
istence of a sequence of algebraic polynomials (p n ) (with constant term zero) 
that uniformly approximates the function \t\ on [— c, c], any c > 0. As such, if / 
is in A, the closure of A in the uniform norm, then so is p n {f ) for each n which 
implies that |/| is also in A. Furthermore 

max{/(x), ff (x)} = + 

and 

mm{f{x),g(x)} = V ' yy ' ^ y ' yy n . 

It thus follows that if /, g G A, then max{/, g} and min{/, g} are also in A. 
This of course extends to the maximum and minimum of any finite number of 
functions. 

Finally, let x, y be any distinct points in X, and a, (3 G M. We claim 
that there exists an h G A satisfying the interpolation conditions h(x) = a and 
h(y) — j3. By assumption there exists a g G A for which g{x) ^ g(y), and 
functions /i and ji in A such that fi(x) ^ while f2(y) 0. If g(x) — then 
we can construct the desired h as a linear combination of g and fi . Similarly, if 
g(y) — then we can construct the desired h as a linear combination of g and 
/2- Assuming g(x) and g(y) are both not zero, the desired h can be constructed, 
for example, as a linear combination of g and g 2 . 

We now present a proof of the theorem. Given / G C(X), e > and 
x G X, for every y G X let h y G A satisfy h y (x) = f(x) and h y (y) = f(y). 
Since / and h y are continuous there exists a neighborhood V y of y for which 
h y (w) > f(w) — e for all w G V y . The U y exV y cover A. As X is compact, it 
has a finite subcover, i.e., there are points yi, . . . , y n in A such that 

n 

U V vi = X - 

i=l 

Let g = max{h yi , . . . , h Vn }. Then jei and g(w) > f(w) — e for all w G A. 

The above 5 depends upon x, so we shall now denote it by g x . It satisfies 
gx(x) = f(x) and g x (w) > f(w) — e for all w G A. As / and g x arc continuous 
there exists a neighborhood J/x of x for which <7a;(u>) < f(w) + e for all w G 
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U x . Since U xe xU x covers X, it has a finite subcover. Thus there exist points 
xi , . . . , x m in X for which 

U u ** = x - 

t=l 

Let 

F = min{gr Xl , . . . ,5x m }. 

Then f el and 

f(w)-e < F(w) < f(w)+e 

for all w e X. Thus 

ll/-*1l<e- 

This implies that / el □ 

Example 4.1. As we mentioned prior to the statement of the Stone- Weierstrass 
theorem, algebraic polynomials in any finite number of variables form an alge- 
bra. They also separate points and contain the constant function. Thus alge- 
braic polynomials in m variables are dense in C(X) where X is any compact 
set in M m . This fact first appeared in print (at least for squares) in Picard 
[1891] which also contains an alternative proof of Weierstrass' theorems. The 
paper Weierstrass [1885] as "reprinted" in Weierstrass' Mathcmatische Werke 
in 1903 contains an additional 10 pages of material including a proof of this 
multivariable analogue of his theorem. 

Another method that can be used to prove density is based on what is 
called the Korovkin theorem or the Bohman-Korovkin theorem. A primitive 
form of this theorem was proved by Bohman in Bohman [1952]. His proof, 
and the main idea in his approach, was a generalization of Bernstein's proof 
of the Weierstrass theorem. Korovkin one year later in Korovkin [1953] proved 
the same theorem for integral type operators. Korovkin's original proof is in 
fact based on positive singular integrals and there are very obvious links to 
Lebesgue's work on singular operators that, in turn, was motivated by various 
of the proofs of the Weierstrass theorems. Korovkin was probably unaware 
of Bohman's result. Korovkin subsequently much extended his theory, major 
portions of which can be found in his book Korovkin [I960]. The theorem and 
proof as presented here is taken from Korovkin's book. 

A linear operator L is positive (monotone) if / > implies L(f) > 0. 

Bohman— Korovkin Theorem 4.2. Let (L n ) be a sequence of positive linear 
operators mapping C[a, b) into itself. Assume that 

lim L n {x l ) = x\ i = 0, 1, 2, 
and the convergence is uniform on [a, b]. Then 



lim (L n f)(x) = f(x) 
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uniformly on [a, b] for every f e C[a, b]. 

Proof: Let / € C[a, b]. As / is uniformly continuous, given e > there exists 
a S > such that if |xi — x 2 \ < S then \f(xi) — f(x 2 )\ < e. 
For each ye [a, 6], set 

Since 

|/(x)-/(y)|< £ 

for \x — y\ < S, and 

| /w _ /(9)|< «Kp£ 

for | a; — y| > 5, it is readily verified that 

Pt(x) < f(x) < p u (x) 

for all x e [a, b]. 

Since the L n are positive linear operators, this implies that 

(L nPi )(x) < (L n f)(x) < (L nPu )(x) (4.1) 

for all x e [a, b], and in particular for x = y. 

For the given fixed /, e and S the p u and pi are quadratic polynomials that 
depend upon y. Explicitly 

Since the coefficients are bounded independently of y e [a, b], and 
lim L n (x l ) = x\ i = 0, 1, 2, 

71 — >00 

uniformly on [a, b], it follows that there exists an N such that for all n > N, 
and every choice of y € [a, b] we have 

\(L n p u )(x) -p u (x)\ <e 

and 

\(L n pe)(x) -pe(x)\ <e 
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for all x £ [a,b]. That is, L n p u and L n pg converge uniformly in both x and y 
to p u and pi, respectively. Setting x = y we obtain 

(LnPu){y) < Pu(y) + £ = f(y) + 2e 

and 

(L n p t )(y) >p e (y)-e = f(y) - 2s. 

Thus given s > there exists an TV such that for all n > N and every 
y G [a, b] we have from (4.1) 

f(y)-2s<(L n f)(y)<f(y) + 2s. 

This proves the theorem. □ 

A similar result holds in the periodic case C[0,2tt], where "test functions" 
are 1, sinx, and cosx. Numerous generalizations may be found in the book of 
Altomare and Campiti [1994]. 

How can the Bohman-Korovkin theorem be applied to obtain density re- 
sults? It can, in theory, be applied easily. If the U n — spanjui, . . . , u n }, 
n = 1, 2, are a nested sequence of finite-dimensional subspaces of C[a, b], and 
L n is a positive linear operator mapping C[a, b] into U n that satisfies the condi- 
tions of the above theorem, then the (wfc)fcLi span a dense subset of C[a, b}. In 
practice it is all too rarely applied in this manner. The importance of the Ko- 
rovkin theory is primarily in that it presents conditions implying convergence, 
and also in that it provides calculable error bounds on the rate of approximation. 

Example 4.2. One immediate application of the Bohman-Korovkin theorem 
is a proof of the convergence of the Bernstein polynomials B n (f) to / for each 
/ in C[0, 1]. Recall from section 2 that for each such / 

n s \ 

^(/;-) = E/© -*)—■ 

We can consider the (B n ) as a sequence of positive linear operators mapping 
C[a, b] into Il„, the space of algebraic polynomials of degree at most n. It is 
readily verified that B n {\ ; x) = 1, B n (x ; x) = x and B n (x 2 ; x) = x 2 + x(l — 
x)/n for all n > 2. Thus by the Bohman-Korovkin theorem B n (f) converges 
uniformly to / on [0,1]. 

Example 4.3. Recall from section 2 that the Fejer operators a n maps C[0, 2ir] 
into T n . It is easily checked that a n is a positive linear operator. Furthermore, 
ct„(1; x) — 1, cr„(sinx; x) — [nj (n+l j) sinx, and cr„(cosx; x) = (n/ (n+1)) cosx. 
Thus from the periodic version of the Bohman-Korovkin theorem <r n (g) con- 
verges uniformly to g on [0, 2tt], for each g £ C[0, 2n}. 
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5 Some Univariate Density Results 

Example 5.1. Miintz's Theorem. Possibly the first generalization of conse- 
quence of the Weierstrass theorems, and certainly one of the best known, is the 
Miintz theorem or the Muntz-Szasz theorem. 

It was Bernstein who in a paper in the proceedings of the 1912 Interna- 
tional Congress of Mathematicians held at Cambridge, Bernstein [1913], and 
in his 1912 prize-winning essay, Bernstein [1912], asked for exact conditions on 
an increasing sequence of positive exponents A„ so that the sequence (x Xn ) is 
fundamental in the space C[0, 1]. Bernstein himself had obtained some partial 
results. In the paper in the ICM proceedings Bernstein wrote the following: It 
will be interesting to know if the condition that the series ^ 1/A„ diverges is not 
necessary and sufficient for the sequence of powers (x n ) to be fundamental; it 
is not certain, however, that a condition of this nature should necessarily exist. 

It was just two years later that Miintz [1914] was able to provide a solution 
confirming Bernstein's qualified guess. What Miintz proved is the following. 

Miintz's Theorem 5.1. The sequence 

t,A Ai A 2 



where < Ao < Ai < A 2 < • • • — > oo is fundamental in C[0, 1] if and only if 
A = and 

oc 

£-=oo. (5.1) 

fe=i fc 



There are numerous proofs and generalizations of the Miintz theorem. 
It is to be found in many of the classic texts on approximation theory, see 
e. g. Achieser [1956, p. 43-46], Cheney [1966, p. 193-198], Borwein, Erdelyi 
[1995, p. 171-205]. (The last reference contains many generalizations of Miintz's 
theorem and also surveys the literature on this topic.) We present here the 
classical proof due to Miintz, with some additions from Szasz [1916] that put 
Miintz's argument into a more elegant form. 

Proof: Let 

M n = span{x A °, . . . ,x Xn }, 

and 

£(/,M„)oc = min ||/-p||oo- 

pGM„ 

Based on the Weierstrass theorem it is both necessary and sufficient to prove 
that 

lim E(x m ,M n ) oo =0 

n^oo 



for each m = 0, 1, 2, . 
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To estimate E(x m , Mn)^ we first calculate 

E(f,M n ) 2 = mm ||/-p|| 2) 

where || • H2 is the L 2 [0, 1] norm. It is well known that 

E 2 (.f,M n ) 2 



2 lf „ ^ _ G(x x °,...,x^,f) 



G(x Xa , . . . ,x A ") 
where G(fi, . . . , / fe ) is the Gramian of /i, . . . , f k , i.e., 



As 



G(h,...,f k ) =det ((/,,/,)) J. =1 



(x p ,x q )=[ x p x g dx= 

Jo p + q+l 



and 

d / 1 y = rii<j<i<r( ffl »- a j)( fe i- b j) 

a simple calculation leads to 

£ 2 (a-,M„) 2 - nLo(--A,) 2 



(2m+l)nLo( m + ^ + 1 ) 2 ' 
Thus, as is easily proven, 

lim E(x m ,M n ) 2 = 

n^oo 

if and only if 

u A m-X k 
™^°° k=o m+ k + 



i.e., 



n/ 2m + 1 



fe=0 



Assuming ra^At for every fc (otherwise there was no reason to do this calcu- 
lation) we have 1 ^ (2m + l)/(m + X k + 1) > and 

2m +1 
hm = 0. 

k^oo m + Afe + 1 



Thus 

n(i 2m+i 

k=0 ^ 



m + Afe + 1 
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2m + 1 
2^ — — — = °' 



fe=0 



m + A fe + 1 



that in turn is equivalent to 



OO 

Ex 



fc=i 



A/, 



oo, 



independent of m. So a necessary and sufficient condition for density in the 
L 2 [0, 1] norm is that (5.1) holds. 

We now consider C[0, 1]. Assume 



oo 1 

Ex 



k=i 



Afe 



< oo. 



Then E(x m , M n ) 2 does not tend to zero as n — > oo for every m that is not one 
of the Afc. As 

E(f, M„) 2 < £(/, M„) oc 
for every / <G C[0, 1], we have that the system 



x A ° , x Al , x X2 , 



is not fundamental in C[0, 1]. Furthermore, if Ao > then all the functions 

X ^ X ^ X 

, . ■ ■ vanish at x = 0, and density cannot possibly hold. 
Let us now assume that (5.1) holds, and Ao = 0. We will show how to 
uniformly approximate each x m , m > 1. For x G [0, 1] 



^2a k x 



k=l 



< 



f 

JO 



£ ^mt™- 1 -^akXkt^- 1 ^ at 

n 

mt™- 1 -^dkXkt^- 1 



fe=i 



< 



fc=i 



2 \ V2 

dt 



Thus we can approximate x m arbitrarily well in the uniform norm from the 
system x Xl , x X2 , ... if we can approximate x" 1 ^ 1 arbitrarily well in the L 2 [0, 1] 
norm from the system x 1-1 , x 2-1 , .... We know that the latter holds if 



fri A fe - 1 



= oo 



k>ko 



where ko is such that Afe — 1 > 0. From (5.1) and since the Afe are an increas- 
ing sequence tending to oo, this condition necessarily holds. This proves the 
sufficiency □ 
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The above method of showing how the L 2 result implies the C[0, 1] result 
is due to Szasz, and simplifies a more complicated argument due to Miintz that 
uses Fejer's proof of the Weierstrass theorem. An alternative method of proof of 
Miintz's theorem and its numerous generalizations is via the functional analytic 
approach, and the possible sets of uniqueness for zeros of analytic functions, see 
e. g. Schwartz [1943], Rudin [1966, p. 304-307], Luxemburg, Korevaar [1971], 
Feinerman, Newman [1974, Chap. X], and Luxemburg [1976]. For some different 
approaches see, for example, Rogers [1981], Burckel, Saeki [1983], and the very 
elegant v. Golitschek [1983]. 

The above proof of Miintz and Szasz as well as most of the functional 
analytic proofs, that use analytic methods, first prove the L 2 result. Rudin's 
approach is more direct, and we reproduce it here. 

Rudin's Proof: Assume — A < Ai < • • •. If (x A ™) is not fundamental in 
C[0, 1] then from the Hahn-Banach theorem and Riesz representation theorem 
there exists a Borcl measure fx of bounded total variation such that 



L 



i 



x An dfj,(x) = 0, 

o 

n = 0, 1, 2, . . .. As A = and A„ > for all n > 1 we may assume the above 
holds for n = 1, 2, . . . and fj, has no mass concentrated at 0. Set 



f(z)= fx z Mx). 
Jo 

For x G (0, 1] and Rez > we have that x z — e zlnx and \x z \ — x Rcz < 1. It 
therefore follows that / is analytic and bounded in the right half plane, and of 
course satisfies 

/(A„) = 0, n=l,2,... 

Now set 

9(z) = f 

The transformation (1 + z)/(l — z) maps the unit disc to the right half plane. 
Thus g G H°°, the space of bounded analytic functions in the unit disc, and 
g(a n ) = where 

_ A n -1 

Now it is a known result associated with Blaschke products that the (a n ) are 
the zeros, in the unit disc, of a nontrivial g € H°° if and only if 



Ed- 



O:. 



< OO. 



71=1 
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It is readily checked that 5Z^ =1 (1 — \ a n\) < oo if and only if X^^Li 1M« < °°- 
Thus if Yl^Li = °°! then 5 = which implies that / = 0. But then 



1 

k 



= /(fc) = / x k d»(x) 
Jo 

for all k = 1, 2, . . . which implies by the Weierstrass theorem that = 0. Thus 
if X^^Li =00 then the (x A ")^ are fundamental in C[0, 1]. 

Assume that J^^Li < 00 • How can we construct the desired measure 
yu? One way is as follows. Set 

1 00 A — z 
= (z + 2) 2 n 2 + X n + Z - 

v 7 n— 

The function / is a meromorphic function with poles at —2 and — A„ — 2, and 
zeros at the A„. / is also bounded in Rez > —1 since each factor is less than 1 
in absolute value thereon. For each z satisfying Rez > — 1 we have by Cauchy's 
formula 

/( „ . _L / m*. 

2m J rR w - z 

where Tr is the right semi-circle of radius R(> l + \z\), centered at —1, together 
with the line from — 1 — iR to —1 + iR. Letting R — ► 00 , it may be readily shown 
that the integral over the semi-circle tends to zero, and we obtain 



As 



1 + z — is 
for Re z > — 1 we have 



2nJ_ 00 1 + z -is 

f x z ~ is dx 
Jo 



Set 



f(z) = J\ z (J- J°° f{-l + is)e- isl ™ds^j Ax. 

1 r°° 

d^(x) = — y f(-l + is)e- is lnx ds. 



This is the Fourier transform of /(— 1+is) at In a; and is bounded and continuous 
on (0, 1], since the factor l/(2 + z) 2 in the definition of / ensures that /(—1 + is) 
is a function in L 1 . Thus we have obtained our desired measure fi. □ 
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Example 5.2. Combining the functional analytic approach with analytic meth- 
ods has proven to be a very effective method of proving density results. As a 
general example, assume g is in C (M) and has an extension as an analytic func- 
tion on all of C Let A be a subset of 1R that contains a finite accumulation 
point, i.e., there are distinct A„ in A and a finite A* such that lim^oo A„ = A*. 
Set 

Ma = span{g(Ax) : A e A}. 
We wish to determine when Ma is dense in C[a,b]. The following result holds. 
Theorem 5.2. Let g, A and Ma be as above. Set 

N g = {n: 5 (n) (0)^0}. 



Then Ma is dense in C[a, b] if and only if: 
i) for [a,b] C (0, oo) or [a, b] C (— oo,0) 

^— ' n 

neN g \{0} 



ii) if a = or b = 0, then OgJV 9 and 



E 

neN g \{0} 



OO, 



Hi) if a < < b, then <G N g and 



^ n ^ n 

n even ti odd 



Proof: The conditions in (i), (ii) and (iii) are exactly those conditions that 
determine when 

span{x" : n G N g } 

is dense in C[a,b}. This is the content of the Miintz theorem in case (ii), and 
easily follows from the Miintz theorem in case (iii). In case (i) it follows from the 
Miintz theorem that the condition therein is sufficient for density. The necessity 
is also true, but needs an additional argument, see e.g., Schwartz [1943]. 

From the Hahn-Banach and Riesz representation theorems Ma is not dense 
in C[a, b] if and only if there exists a nontrivial measure \i of bounded total 
variation on [a, b] satisfying 




g(Xx) d/i(x) = 
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for all A G A. Assume such a measure exists. As g is entire, it follows that 

rb 



H z ) = / g(zx)dfj,(x) 

J a 



is entire. Furthermore h(X) = for all A G A. By assumption A contains a 
finite accumulation point. Thus by the uniqueness theorem for zeros of analytic 
functions h = 0. However h being identically zero does not necessarily imply 
that /i is the zero measure. It only proves that 



M\ = span{g(Aa;) : A G 1R}- 

For example, if g is a polynomial of degree m, then Ma is simply the space of 
polynomials of degree m. 
As 

rb 

g(zx) dfi(x) = 



/ 

J a 



and g is entire it may be shown, differentiating by z, that 

rb 

x n g (n) (zx)dfi(x) = 



/ 

J a 



for every nonnegative integer n. Setting z = gives us 

r b 



(n) (O) f x n dii{x) =0, n = 0,l, 

J a 



Thus 

ft 

x n dfi(x) = 0, 

for all n G iV fl . But span{x" : n G A^ g } is dense in C[a,b], so /x is the trivial 
measure. 

On the other hand, assume the conditions in (i), (ii) or (iii) do not hold. 
Thus span{x" : n G N g } is not dense in C[a,b], and there exists a nontrivial 
measure fi of bounded total variation satisfying 

rb 

x n dfi(x) = 



/ 

J a 



for all n ^ N g . Since g is entire 

«*) - v 9 '"' (0) 

and it follows that 



n£N„ 



X" 



L 



b 

g{\x) dfi(x) = 



for all A G M. Ma is not dense in C[a, b]. □ 
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For example, if g(x) = e x then N g = Z + so that (i), (ii) and (iii) always 
hold. Thus 

span{e A " x : A e A} 

is always dense in C[a, b] assuming A is a subset of M with a finite accumulation 
point. A change of variable argument implies that under this same condition 
on A the set 

span{x A " : A e A} 

is dense in C[a, (3} for every < a < [3 < oo. 

A question related to Miintz type problems is that of the fundamcntality 
of the functions (e XnX ), where (A„) is a sequence of complex numbers. This has 
been considered in the space of complex- valued functions in C[a, b], C(M + ), 
L p [a,b] and L P (M + ), 1 < p < oo. There has been a great deal of research 
done in this area, see, for example, Paley, Wiener [1934, Chap. VI], Levinson 
[1940, Chap. I and II], Schwartz [1943], Levin [1964, Appendix III], Levin [1996, 
Lecture 18], and the many references therein. 

Example 5.3. Akhiezer's Theorem. Let T be a subset of M\{— 1,1], and 
consider the set 

A/r = span <^ : 7 e T } . 

l*-7 J 

When is Afr dense in C[— 1,1]? One result is similar to Theorem 5.2. It may 
be found in Feinerman-Newman [1974, p. 116-117], but the proof therein is 
somewhat different. 

Proposition 5.3. IfT has either a finite accumulation point in 1R\[— 1,1] or 
00 is an accumulation point, then A/r is dense in C[— 1, 1]. 

Proof: Assume A/r 7^ C[— 1 , 1] . Then there exists a Borel measure u, of bounded 
total variation such that 




d/x(i) = 



for all 7 e T. Set 

f(z)= ( -i-dM*). 

J -1 t - z 

Note that f(-y) = for all 7 e T. It is readily verified that / is analytic on 
C\[— 1, 1], and analytic also at infinity. 

Thus if T has either a finite accumulation point in M\[— 1, 1] or 00 is an 
accumulation point, then / = 0. For |z| > 1 > \t\ 

/(*) = -£ - / t n Mt)- 
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As / = 0, this then implies that 

J t n dn(t) = 

for all n, which from the Hahn-Banach and Weierstrass theorems implies that 
fi is the trivial measure. This proves the proposition. □ 

What can be said if the only accumulation points of T are 1 or — 1 or both? 
The result is known, contains the previous Proposition 5.3 as a special case, 
and was proven by Akhiezer, see Achieser [1956, p. 254-256]. 

Akhiezer's Theorem 5.4. Let (~/ n )%Li be a sequence in M\[— 1, 1], and con- 
sider the set 

N = span < : n = 1, 2, . . . > . 

I * ~ In J 

Then N is dense in C[— 1, 1] if and only if 

oo 

^l-|7n-V^~l|=^ 

n=l 



See also Borwein, Erdelyi [1995, p. 208] where a different method of proof 
is used. They also give the above condition as 

oo 

- 1 = oo, 

n=l 

and these two conditions are in fact equivalent. Akhiezer's proof of this theorem 
is delicate and detailed, dependent on the construction of specific best approx- 
imants. We will not reproduce it here. Michael Sodin has a proof which uses 
complex variable theory. 

Example 5.4. The analysis literature is replete with results concerning the 
density of translates (and dilates) of a function in various spaces. These might 
be arbitrary, integer, or sequence translates (or dilates). Many of these results 
are generalizations, in a sense, of the Miintz and/or Paley- Wiener theorems. 
See, for example, both Example 5.2 and 5.3. 

There is a characterization of those / £ C (M) for which 

span{/(- — a) : a G M} 

is not dense in C(Ft) (in the topology of uniform convergence on compacta). 
Such functions are called mean periodic, see Schwartz [1947]. 

Some functions in C(M) have a further interesting property. 
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Proposition 5.5. Assume f = g (f is the Fourier transform of g) for some 
nontrivial g € L 1 {1R) with the support of g contained in an interval of length 
at most 2n. Then 

span{/(- — n) : n e Zj} 
is dense in C(M) (in the topology of uniform convergence on compacta). 

Proof: Assume the above set is not dense in C(M). There then exists a Borel 
measure \i of bounded total variation and compact support E such that 



Se 



f(x — n) dyu(x) = 



for all n e Z5. Assume / = g, as above, and supp{g} C [a, a+ 2n]. Thus for 
each n e % 

= / f(x — n) d/i(x) = / g{x — n) d/i(x) 

J E J E 



where fi is the Fourier transform of the measure /i. It is well known that fi, is 
an entire function. 

As all the Fourier coefficients of gfi on [a, a + 2n] vanish we have that gji 
is identically zero thereon. This implies that g must vanish where ^ 0. As /I 
is entire this implies that g = 0, a contradiction. □ 

The above is a simple example within a general theory. The interested 
reader should consult Atzmon, Olevskii [1996], Nikolski [1999], and references 
therein. Note that there is no function whose integer translates are dense in 
L 2 (R). 

Example 5.5. The Bernstein Approximation Problem. Assume w is a weight 
on Ft by which we will mean a non-negative, measurable, bounded function. 
For each / in C(M) satisfying 

lim u(x)f(x) = 

|a;|^oo 

set 

ll/ll^, = supw(x)\f(x)\, 

and let C U (IR) denote the real normed linear space of those / as above with 
WfWui < oo. The Bernstein approximation problem was first formulated in 
Bernstein [1924]. It asks for necessary and sufficient conditions on a weight w 
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such that (algebraic) polynomials are dense in C LJ {1R). That is, for each / in 
C w (iR) and e > there exists a polynomial p for which ||/ — < e. This 
immediately implies that u> must satisfy 

lim u)(x)p(x) = 

for every polynomial p. 

In Bernstein [1924] can be found the following result. Assume oj(x) = 
l/q(x), where 

oo 

q(x) = a nx 2n 

with a > 0, a n > for all n, and q not the constant function. Then a necessary 
and sufficient for polynomials to be dense in C U {M) is that 

In general a condition of this form is necessary, but not sufficient. It is often 
sufficient for "reasonable" weights. 

The literature on this problem is rather extensive including the review 
articles Ahiezer [1956] and Mergelyan [1956], see also Lorentz, v. Golitschek 
and Makovoz [1996, p. 28-33], and Timan [1963, p. 16-19]. The article of 
Mergelyan, as well as Prolla [1977], includes a proof of this next result. Let M.^ 
denote the set of polynomials p satisfying u{x)\p{x)\ < 1 + |x| for all x e M, 
and set 

M u (z) =sup{|p(z)| : peM^}. 

Mergelyan's Theorem 5.6. Let u be as above. Then a necessary and suffi- 
cient for polynomials to be dense in (M) is that 

M u (z) = oo 

for every z e C\M. 

Unfortunately this condition is not easy to check. 

Here is a condition that is easier to check, but which only holds for certain 
weights. Assume u — exp{— Q}, u> is even, and Q is a convex function of In a; 
on (0, oo). Then 

f^ — 

is both necessary and sufficient for polynomials to be dense in C U {R), see 
Mhaskar [1996, p. 331]. 
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Example 5.6. Markov Systems. Assume we are given a sequence of functions 
(u m )m=o m C[ a ' What we have been asking is when this sequence is funda- 
mental, i.e., linear combinations are dense. That is when, for each / in C[a, b] 
and e > 0, there exists a finite linear combination u of the (u TO )^ =0 such that 

11/ - w||oo < e. 

The only general result characterizing the density of such sequences is the some- 
what tautological Theorem 3.6. However, when the sequence (u m )m = o has a 
particular type of Chebyshev property, then P. Borwein proved a surprisingly 
interesting condition equivalent to density. 

To explain his result we first need some definitions. Given the sequence 
(«m)m=o in C[a, b] we set 

U n = span{u ,ui, . . . ,u n } 

for each n £ Z+. We say that U n is a Chebyshev space if no u E U n , u ^ 0, 
has more than n distinct zeros in [a, b]. We say that the sequence (u m )^ =0 is a 
Markov sequence if U n is a Chebyshev space for n = 0, 1 . . .. There are numerous 
examples of Markov sequences. For example, (x Xm )^ =0 1S a Markov sequence 
on [a, b] where a > and the (A m )^_ are arbitrary distinct real values, while 
(l/(x — c m ))^ =0 is a Markov sequence on any [a, b] where the (c m )^ =0 are 
distinct values in M\[a,b]. 

In what follows we assume that the (u m )^ =0 is a Markov sequence. Let 
t n G U n be of the form t n = u n — v n with v n G U n -\, satisfying 

||in||oo = min ||u„ - u||oo- 

It is well known, from the Chebyshev and Markov properties, that t n is uniquely 
defined and has n zeros in (a, b). Let x\ < ■ ■ ■ < x n denote these n zeros and 
set xq = a, x n+ i = b. The mesh of t n is defined by 

m„ = max (xi — x;-i). 

i=l,...,n+l 

It is readily proven that for any k < n the function tk has at most one zero 
between any two consecutive zeros of t n . From this it follows that 

lim m n = 

n— >oc 

if and only if 

liminf m n = 0. 

n — >oo 

The following result may be found in Borwein [1990], and also in Borwein, 
Erdelyi [1995, p. 155-158]. 

Borwein's Theorem 5.7. Assume (u m )^ =0 is a Markov sequence in C 1 [a, b] 
and u = 1. Then the sequence (w m )^ =0 JS dense in C[a, b] if and only if 

lim m„ = 0. 

n^oc 
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A similar result relating density to Bernstein-type inequalities is in Borwein, 
Erdelyi [1995b], and Borwein, Erdelyi [1995, p. 206-211]. 

Example 5.7. The following result is a special case of a general theorem of 
Schwartz [1944] (see also Pinkus [1996] and references therein). Here we again 
consider C (M) , with the topology of uniform convergence on compacta. We are 
interested in determining the set of functions in C(M) that are both translation 
and dilation invariant. 

Proposition 5.8. If a £ C(M), a ^ 0, then 

C(M) = span{cr(a ■ +0) : a, £ M} 

if and only if o is not a polynomial. 
Proof: Let 



If Ma 7^ C(M) then there exists a nontrivial Borel measure \x of bounded total 
variation and compact support such that 



for all a,0 £ M. Since \i is nontrivial and polynomials are dense in C(M) in the 
topology of uniform convergence on compact subsets, there must exist a k > 
such that 



It is relatively simple to show that for each (j> £ Cq°(M), (infinitely differen- 
tiable and having compact support) the convolution (cr* <f) is contained in A4 a . 
Since both a and <f> are in C(JR), and 4> has compact support, this can be proven 
by taking limits of Riemann sums of the convolution integral. We also consider 
taking derivatives as a limiting operation in taking divided differences. Since 
(a * <fi) £ C°°(1R), and thus it and all its derivatives are uniformly continuous 
on every compact set, it follows that for each a,0 £ M 



M a = span{cr(a • +0) : a, £ M}. 





d 



(a * <p)(ax + 0) 



(o*<j>) {n \ax + 0) £ M g . 



= x 



Thus 




for all a,0 £ M and n £ Z+. Setting a = 0, we see that 
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for each choice of f3 € 1R, n € Z + and <p e Cq°(M). This implies, since 
J K £ fe d/z(x) 7^ 0, that 

(<7*<£)( fc )=0 

for all (/> € C£°(M). That is, cr( fe ) = in the weak sense. However, as is well- 
known, this implies that — in the strong (usual) sense. That is, a is a 
polynomial of degree at most k — 1. 

The converse direction is simple. If a is a polynomial of degree m, then 
M a is exactly the space of polynomials of degree m, and is therefore not dense 
in C(M). □ 

Example 5.8. Splines are piecewise polynomials with a high order of continu- 
ity. For a = £0 < £1 < • • • < £fc < £fc+i = 6 we set 

S„(a, ...,&) = {*€ Cf"- 1 ) [a, 6] : s e U n \ {u _ u6i) ,i = 1, . . . , fc + 1}. 

Hence a function s belongs to <S„(£i, . . . if it is a C^™ -1 ) function, i.e., has 
a certain global level of smoothness, and is a polynomial of degree at most n on 
each of the intervals £»). We say that <S„(£i, . . . , £fc) is the space of splines 

of degree n with the simple knots (£1, • • • , £&)• When using splines one fixes the 
degree and permits the number (and placement) of the knots to vary. From the 
perspective of numerical computations, approximation by splines enjoys many 
advantages over approximation by algebraic and trigonometric polynomials. As 
the number of knots increases the corresponding space of splines may or may 
not "become dense" in C[a, b]. Whether it docs or not simply depends upon if 
the knots become dense in [a, b]. 

To be more exact, for each k = 1, 2, ... let 

Sk = Sn(£l , ■ ■ ■ 1 £fc) 

for some set of k knots as above, where £q = a and £^ +1 = b. For each such k, 
let 

denote the maximum mesh length. Then we have, see for example, de Boor 
[1968], 

Proposition 5.9. For each f e C[a, b] there exist Sk € Sk such that 

lim ||/ - SfeHoo = 

k^oo 

if and only if lim^oo rrik = 0. 

Proof: We first assume that lim^oo mk = 0. The set 



(1, x, . . . , x n , (x £1 )™ , ■ • • , (x — ), 
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where equals x n for x > and for x < 0, is a basis for Sfc. However there 
are also 'better' bases. They are given by B-splines 

. . . , B n+k+1 ). 

Each is nonnegative, Y,"=i +1 B i = 1 on [a, b], and supp £?f = [^-n-i,^] 
where £ n < ■ ■ ■ < < a, and 6 < $ +2 < • • • < # +fc+1 . 
For each <5 > 0, let 

= max |/(a;) - f(y)\ 

\x—y\<d 

denote the usual modulus of continuity of /. As / is continuous on [a, b] it is 
also uniformly continuous thereon and thus 

lim u}(f;8) = 0. 
Now choose t\ <G supp B \ n [a, 6], i = 1, . . . , n + k + 1, and set 

n+fe+l 

E /(*?)**(*)■ 

i—l 

(sfe is called a quasi-interpolant.) Then for a; e , j € {0, 1, • • • , fc} we 

have 

n+fe+l 

l/^-^x)! = !/(«)- E f(ti) B H*)\ 

i=l 

< E !/(*)- 

i=l 
n+j+2 

= E \m-fti)\BH*) 

since Bf(ar) = for i $ {j, . . . , n + j + 2}. Thus 

\f(x) - s k (x)\ < max \f(x)-f(U)\<u)(f;(n + 2)m k ), 

i=j,...,n+j+2 

and 

||/-*fc|| <w(/; (n + 2)m fc ), 

from which we obtain 

lim ||/- s k \\ = 0. 

If the TOfc do not tend to zero, then there is a subinterval [c, d] of [a, b] of 
positive length, and a subsequence (fc m ) of (k) such that Sfc m has no knots in 
[c, d]. That is, each function in S km is a polynomial of degree at most n on [c, d]. 
If / G C[a, b] is not a polynomial of degree at most n on [c, d], then there exists 
a C > such that 

ll/-«ll>c 

for all s e Sfe m . □ 
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Each >S„ (£1 , . . . , £fc ) is a linear space of splines of degree n with k fixed 
knots. One can also consider the nonlinear set S n .k of splines of degree n with 
k free knots, i.e., 

n k 

S n ,k = {s(x) = °i xl + di ( x ■■ Ci,di e R, a <£!<■■■< £ k < b}. 

2=0 4=1 

(Note that the set S n .k is not closed.) 

The following is a consequence of our previous density result. 



Proposition 5.10. Let p e (l,oo). Assume / € L p [a,b]\S n .k and s* e S U: k 
satisfies 

\\f- S *\\ P <\\.f-s\\ p 



for all s € <S n ,fc- Then s* £ S n ^-i- 

Proof: Let F be a continuous linear functional on L p [a,b] that satisfies 

= 1 

and 

F(f-s*) = \\f-s*\\. 

As / — s* ^ and p e (1, oo), such an F exists and is unique. 

Our proof is by contradiction. If s* e S n ^-i, then for each d € M and 
£ € [a, 6] we have 

ll/-*l P <ll/-«*-d(*-0+lli>- 
Thus from Theorem 3.5 we have 

F((x-0l) = 

for each £ <G [a, 6]. In addition, as 

\\f-s*\\ p <\\f-s*-p\\ p 

for every p e LT„ , we have 

F(a; fe )=0, fc = 0,...,n. 

From Proposition 5.9 

span{l, : £e [M]} 

is dense in C[a, 6] and thus in L p [a,b]. From Theorem 3.6 this implies that 
F = 0. A contradiction. □ 



Density in Approximation Theory 



31 



The exact same argument proves, for example, that if 

k 

V k = a 3 XmS '■ a 3 G ^ m 3 /Z -I 1 

3=1 

then in L p [a, b], 1 < p < oo, any best approximation to / from Vk is never 
contained in Vk-i- 

The key ingredients of the above argument are the density of the set under 
consideration (in the above examples, splines and algebraic polynomials) in the 
normed linear space E, and the fact that E is smooth. That is, to each nonzero 
element of the space E there is a unique continuous linear functional of norm 
one that attains its norm on the given element. 

Example 5.9. Here are two examples where we consider the density of positive 
cones. That is, we present some applications of Theorem 3.8. 

Let II denote the space of all algebraic polynomials and n + the positive 
cone of all algebraic polynomials with nonnegative coefficients. We first prove 
the following result due to Bonsall [1958]. 

Theorem 5.11. The uniform closure of U + on [—1,0] is exactly the set of f 
in C[-l, 0] for which /(0) > 0. 

Proof: Let g n (x) = (1 + x) n and 4> n {x) = x n for all n € Z + . Note that g n and 
<j) n arc in n + , and 




Assume F is a continuous linear functional on C[— 1,0] satisfying F{(f> n ) > for 
every n € Then 

%)>(j)f(W- 
Now \\g n \\ = \\<t>n\\ = 1 for all n - Thus 

\\F\\>$F m , 

for each n > k. Fix k > 1 and let n — > oo. This implies that F(<f>k) — for all 
k = 1,2,.... Thus each icpk, k > 1, is in the uniform closure of 11+ on [—1,0]. 
As every / G C[— 1, 0] satisfying /(0) = is in the uniform closure of the space 
generated by the ±<frk, k > 1, the result now easily follows. 

Bonsall actually proves that each F as above is necessarily of the exact 
form F(f) = c/(0) where c = F(l) > 0. This he proves as follows. For each 
/ e C[-l, 0] and e > 0, let p e II satisfy 



Il/-P||<e 
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Thus we have |/(0) - p(0)| < e. Since F(<f> k ) = 0, k > 1, we obtain 



- F(1)P(0)| = - P(0))| = - p)| < 



Furthermore 



|F(/) - F(1)/(0)| = - F(l)p(0) + F(l)p(0) - F(1)/(0)| < 2e||F||. 



There is an alternative method of proving this result via a slight gener- 
alization of the Stone- Weierstrass theorem. Consider the set of / in C[— 1,0] 
satisfying /(0) = 0. Now e ax — 1 is in the uniform closure of II + on [—1, 0] for 
a > 0. (Truncate the power series expansion about 0.) Furthermore if / and g 
arc in this closure then so is fg. As e ax — 1 approaches —1 uniformly on [—1, 5] 
for any S < as a — > oo, and is bounded on [5,0], it follows that for / in the 
uniform closure of II + on [—1, 0] and satisfying /(0) =0we also have — / in this 
same closure. In addition p(x) = x is nonzero for all x ^ and separates points. 
Thus from an elementary generalization of the Stone- Weierstrass theorem the 
uniform closure of II + on [—1,0] contains the set of all / in C[— 1,0] satisfying 
/(0) = 0. The result now follows. 

Thus for any a < b < the uniform closure of II + on [a, b] is exactly all of 
C[a,b]. What happens if [a, b] C [0, oo)? It is well known that in this case the 
uniform closure of II + is simply the set of analytic functions in [a, b] given by a 
power series about with nonnegative coefficients which converges in [a, b]. 

There are also somewhat surprising results due to Nussbaum, Walsh [1998], 
generalizing work of Toland [1996]. These results are used to investigate when 
the spectral radius of a positive, bounded linear operator belong to its spectrum. 
A special case of what they prove is the following: 

Theorem 5.12. For any a < — 1 the uniform closure of H + on [a, 1] contains 
the set of all f in C[a, 1] that vanishes identically on [— 1, 1]. 

Proof: We present two proofs of this result. The first proof uses the Hahn- 
Banach theorem and is that found in Nussbaum, Walsh [1998]. The second 
proof is constructive. 

Assume we are given any continuous linear functional F on C[a, 1] satisfying 
F(x n ) > for all n £ + . From the Riesz representation theorem, this implies 
the existence of a Borel measure fi of bounded total variation satisfying 



As this is valid for each e > we obtain 



F(/) = F(l)/(0). □ 
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for all n € Z + . We will prove that supp{^} C [—1,1]. As this is true then 

f(x) dn(x) = 



/ 

J a 



for every / in C[a, 1] that vanishes identically on [—1, 1], proving our theorem. 
To this end, consider 



G is analytic in C\[a, 1], and vanishes at oo. For \z\ > A = sup{|ai| : x £ 
supp{/i}} we have 

oo 



z- 

n=l 



where 

c„ = / a;™ d y u(a;) > 0. 

J a 

Note that H(z) = G(l/z) is analytic in C\{(— oo, 1/a] (J[l, oo)} and has about 
the origin a power series expansion with nonnegative coefficients. From a the- 
orem of Pringsheim, if the radius of convergence of the power series is p > 
then the point z = p is a singular point of the analytic function represented by 
the power series. As the power series converges on [0, 1) the radius of conver- 
gence is at least 1, and therefore H is analytic in C\{(— oo, — 1] (J[l, oo)} and 
G analytic inC\[— 1, 1]. That is, G is in fact analytic in [a, —1). This implies, 
see Nussbaum, Walsh [1998, p. 2371], that the measure d^ has no support in 
[o,-l). 

The following constructive proof of this result is based on a variation of a 
proof to be found in Orlicz [1992, p. 99]. For n G ffl, odd, consider the function 



9n(x)= f e 4 "/"-ldt. 
Jo 



Note that the integrand is uniformly bounded on [a, 1] and 

lim e*" /n - 1 = 

As 



e 

n^oo 



0, — 1 < i < 1 
-1, a<t<-l. 



this function is in the uniform closure of Thus, so is g n . Set 

r( , _ f -(x+ 1), a<x<-l 
lJ 10, -1<z<1. 
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Then 

lim (G(x)-g n (x)) = 

n— >oo 

uniformly on [a, 1]. That is, for all x G [—1, 1] 



e*"/" - 1 dt 



< e 1 /™ - 1, 



while for x G [a, —1] 



-(x + l)-J* (e*" /n -l) . 
-(x + 1) - (e*"/" - l) d* + y" (e*"/" - l) 



dt 



< 



J e tn/n dt + J (l - e* n/ ") dt < J e tn/n dt+{l-e- 1/n ). 



Thus G is in the uniform closure of II + on [a, 1]. 

Moreover, as seen above, the function e*"/™ — 1 is uniformly bounded and 
approaches 

0, -l<z<i 
— 1, a < x < —1. 



H(x) 



The convergence to H is uniform in [a, 1], away from any neighbourhood of — 1. 
Thus GH = — G is also in the uniform closure of 11+ on [a, 1], and therefore 
the uniform closure of II + on [a, 1] contains the algebra generated by G. An 
elementary generalization of the Stone- Weierstrass theorem implies that the 
uniform closure of 11+ on [a, 1] contains the set of all / in C[a, 1] which vanish 
identically on [—1,1]. □ 

The above result is an extension of Theorem II' in Orlicz [1999, p. 96]. 
Orlicz proved that for every / G C[a, — 1] satisfying /(— 1) = and for each 
e > 0, there exists a p G II of the form 

n 

P( x ) =^2a k x k 



simultaneously satisfying 
and 



ll/-p||[o,-i] <£ 

n 

^2\a k \ <e. 

k=0 
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6 Some Multivariate Density Results 

In the section we consider applications of the results of the previous sections to 
multivariate functions. 

Example 6.1. We start with an application of the Stone- Weierstrass theorem. 
Let hi, . . . , h m be any m fixed real- valued continuous functions defined on X, 
a compact set. Let 

m 

M = sp&n{g(^2a t h t ) : a = (ai, . . . ,a m ) e M m , g E C(M)}. 

i=l 

When is M dense in C(X)? 

Proposition 6.1. M = C(X) if and only if for each x, y e X, x ^ y, there 
exists i G {1, . . . , m} such that hi(x) ^ hi(y). 

Proof: If there exists an x ^ y for which hi(x) = hi(y), i = 1, ...,m, then for 
every / e M we have f(x) = f(y) and obviously M. ^ C{X). On the other 
hand, assume that for each x, y e X, x ^ y, there exists i e {1, . . . , m} such 
that hi(x) 7^ hi(y). Consider the linear span of the set 

m 

(^2 a^i) 

i=l 

as we vary over all a € M m and k = 0, 1, 2, This is an algebra generated by 

where the ii are non-negative integers. Furthermore this algebra contains the 
constant function and separates points. Thus the density follows from the 
Stone- Weierstrass theorem. □ 

Example 6.2. Is it true that for arbitrary compact sets X and Y we always 
have that span{C(X)xC(F)} is dense in C(XxY)7 Ii X and Y are compact 
subsets of M this follows from the fact that algebraic polynomials are dense in 
C(XxY), and each algebraic polynomial is a linear combination of products of 
monomials in X and monomials in Y. Similarly to Example 6.1 we have: 

Dieudonne Theorem 6.2. If X and Y are compact, then the linear space 
span{C(X)xC(Y)} is dense in C(XxY). 

Proof: For / e C(X) and g E C(Y) the function (fxg)(x,y) = f(x)g(y) is 
in C(XxY). Furthermore all finite sums of the form fixgi + • • • + j m xg m 
clearly form a subalgebra of C(XxY) that contains the constant function and 
separates points. Thus by the Stone- Weierstrass theorem span{C(X)xC(F)} 
is dense in C(XxY). □ 
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This theorem, originally proven in Dieudonnc [1937] by other methods, 
easily extends to a product of any finite number of compact spaces. It may also 
be found in Nachbin [1967] and Prolla [1977]. 

Example 6.3. Here is a simple application of the functional analytic approach 
to density. We consider C(lR n ) with the topology of uniform convergence on 
compacta. As we recall, the set of functions for which the span of all their 
translates are not dense in C(M n ) are called mean-periodic functions. There 
is no known characterization of mean-periodic functions in C(M n ) for n > 2. 
However not many functions can be mean-periodic. For example 



Proof: The continuous linear functional on C(M n ) are represented by Borcl 
measures of bounded total variation and compact support. If the above space 
is not dense in C(M n ), then there exists such a nontrivial measure /x satisfying 



for all a £ M n . Both g and fj, have "nice" Fourier transforms. Since the above 
is a convolution we must have 

<?(w)/i(w) = 0. 

Now fl is an entire function, while g is continuous. Since g must vanish where 
/j 7^ 0, it follows that g = and thus g — 0, a contradiction. □ 

Example 6.4. Let (• , •) denote the usual inner (scalar) product on M n . Ap- 
plying Propositions 6.1 and 5.8 we prove the following result. 

Proposition 6.4. For each a £ C(M) 

span{cr((a ,•)+&) : a £ M n , b £ M} 

is dense in C(M n ) (uniform convergence on compacta) if and only if a is not a 
polynomial. 

Proof: If a is a polynomial of degree m, then each a ((a , •) + b) is contained in 
the space of polynomials of total degree at most m on lR n , and thus the above 
span is certainly not dense in C(M n ). 

Assume a is not a polynomial. Choose an / in C(M n ), X any compact 
subset of M n , and e > 0. From an application of Proposition 6.1 we have the 
existence of gt £ C(M) and a k £ M" , k = 1, . . . , m, such that 



Proposition 6.3. If g £ C{R n ) n L 1 



(M n ), g ^ 0, then 



C(M n ) = span{. 9 (- - a) : a e 1"}. 




in 



/(x)-^ fffc ((a fc ,x)) <e 
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for all x £ X. Let [c,d] be a finite interval of Ft containing all values (a fc , x) 



for x £ X and fc = 1, 



, to, i.e., 



|J{(a fe ,x) : x€X}C M. 



fc=i 



From Proposition 5.8 we have the existence of Cik,ctik, Pik € M, i = 1, . 
fc = 1, . . . , to, for which 



< 



i=l 

for all i G [c, d] and fc = 1, . . . , to. Thus for all x e X 

m n k 

/(x) - ^2 X! Ciko-iaik (a fe , x) + /3 ife ) 



TO 



fe=l 



< 2e, 



which proves the density. □ 

Proposition 6.4 is a basic result in one of the models of neural network 
theory, see Leshno, Lin, Pinkus, Schocken [1993] and Pinkus [1999]. 

Example 6.5. Ridge Functions. Ridge functions were considered in the previ- 
ous example. They are functions of the form <?((a, •)) for some fixed 'direction' 
a and some function g G C(M). They are functions constant on the hyperplanes 
{(a , x) = t} for every t E M. 

Let f2 be a subset of M n . In what follows we assume that f2 is a subset 
of S'™~ 1 , i.e., all elements of are of norm 1. (This is simply a convenient 
normalization.) The question we ask is: What are necessary and sufficient 
conditions on fl such that the set of all ridge functions with directions from 
are dense in C(M n ). The result we prove is due to Vostrecov, Kreines [1961], 
see also Lin, Pinkus [1993]. We will apply both the Weierstrass theorem and 
the Riesz representation theorem in obtaining these conditions. 

Let 

M(Q) = span{#((a, •)) :ae!l, g e C(M)}. 
Note that we vary over all a £ Q and all g £ C (Ft) . 

Theorem 6.5. The linear space A4(Cl) is dense in C(M n ) in the topology of 
uniform convergence on compacta if and only if the only homogeneous polyno- 
mial (of n variables) that vanishes identically on ft is the zero polynomial. 

Proof: (=>)■ Assume there exists a nontrivial homogeneous polynomial p of 
degree fc that vanishes on SI. Let 



p(y) = E 5 -y m 



|m| = fe 
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where m = (mi, . . . , m n ) G /P. " . |m| = mi + • • • + m n and y m = y™ 1 • • • j/™". 
Choose any <j> e C§°(lR n ), 0^0. For each m e |m| = fc, set 



<9 A 



fe™ 1 • • • dxT 



and define 



V(x) - ^ 6 m i? m 0(x). 

|m|=fc 



Note that ^ G C§°(M n ), ip ^ 0, (supp-0 C supp</>), and 

^ = « fc </>p 

where ~ denotes the Fourier transform. As p is homogeneous, p(Xa) = V>(Aa) = 
for all a G O. and X <E M. 
We claim that 



/ 5 ((a,x))^(x)dx = 

JlR" 



for all a G f2 and g € C(JR), i.e., the nontrivial linear functional defined by 
integrating against ip annihilates A4(Cl). From the Riesz representation theorem 
this implies that M(fl) is not dense in C(M n ). 
We prove this as follows. For ae O we write 

= £(Aa) = j V(x)e- A < a > x > dx 



(2tt)"/ 2 J_oo L7 (a ^ x)=t 



Since this holds for all A <E M, we have that 



'<a,x)=t 

for all t. Thus for any g G C(iR) 



/ V(x) dx = 

J(a,x)=t 



I <?«a,x))V( 



x) dx 



(a,x)=t 



V>(x)dx p(t) dt = 0. 



(«*=). Assume that for a given k G iV no nontrivial homogeneous polynomial 
p of degree k vanishes identically on f2. We will prove that A4(Cl) includes all 
homogeneous polynomials of degree k (and thus all polynomials of degree at 
most k). If the above holds for all k G Z, + it then follows that ftA{Q) contains 
all polynomials and therefore A4(fl) = C(M n ). 
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Let a e fl and set g((a,x)) = ((a,x)) fe , whence ((a,x)) fe e Since 
£)mi x m 2 _ (5 mi m2 /c!, for mi,m 2 € <#".' . |nii| = |m 2 | = fc, it easily follows that 
every linear functional I on the finite dimensional linear space of homogeneous 
polynomials H£ of degree k may be represented by some q £ HJ} via 

t(p) = q(D)p 

for each p <G i? ^ . 

For any given q <E H£, 

9(£>)((a,x)) fe = fc!g(a). 

If the linear functional £ annihilates ((a , x)) fc for all a S fi, then its representor 
q £ H% vanishes on ft. By assumption this implies that q = 0. The fact that 
no nontrivial linear functional on H% annihilates ((a.,x)) k for all a G Q implies 

HI =span{((a,x)) fc :aeSl}. 

Thus HI CM(fl). □ 

Example 6.6. There are other results of the same general flavor as that found 
in Example 6.4. For example, assume || • || is the usual Euclidean norm on M n . 
Then we have from Pinkus [1996] the following two results. 

Proposition 6.6. For each a e C(1R + ) 

sp&n{a(p\\ • -a||) : p>0,ae FT} 

is dense in C(M n ) (uniform convergence on compacta) if and only if a is not an 
even polynomial. 

Proposition 6.7. For each a e C(1R) 

span{cr Y[( - h^j : a,h, . . . ,b n € R} 

is dense in C(M n ) (uniform convergence on compacta) if and only if o is not of 
the form 

r r n—1 

<7(i) = ^c ^+^^ C ^(ln|i|r 

j=0 j = \ 1=1 

for some finite r and coefficients (%■). 
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Example 6.7. A very interesting result, with applications in Radon transform 
theory, is that conjectured by Lin and Pinkus and proved by Agranovsky, Quinto 
[1996]. It characterizes the set of centers of radial functions needed for density. 
The complete answer is only known in M 2 . 

Theorem 6.8. Let A C E 2 . Then 

span{. 9 (|| --a||) : g e C(M), a e A} 

is not dense in C(M ) (uniform convergence on compacta) if and only if A is 
composed of a finite number of points together with a subset of a set of straight 
lines having a common intersection point and where the angles between each of 
the lines is a rational multiple of tt (a Coxeter system of lines). 

Example 6.8. The functions j| • — a|| 2 are shifts of the polynomial q(x) = 
S"=i x i- Assume we are given an arbitrary polynomial p. Under what exact 
conditions do we have that 

span{ 5 (p( ■ - a)) : g e C(M), a e M n } 

is dense in C(M n )l This next result, as well as variations thereof, can be found 
in Pinkus, Wajnryb [1995]. 

Theorem 6.9. Let p be an arbitrary polynomial in M n . Then for n = 1,2, 3, 

span{ 5 (p( ■ - a)) : g € C(M), a € M n } 
is dense in C(M n ) (uniform convergence on compacta) if and only if 

span{p( • - a) : a e lR n } 

separates points. 

By "separates points" we mean that for any given x, y € M n , x^y, there 
exists a a <E M n for which 

p(x-a) ^p(y-a). 

This condition is obviously necessary. The sufficiency is far from trivial. For 
n = 4 it is also sufficient if p is a homogeneous polynomial. However for n > 4 
this condition is not always sufficient. 

Example 6.9. Miintz's Theorem. The Muntz problem in the multivariate set- 
ting is significantly more difficult than in the univariate setting. Some sufficient 
conditions have been given, but the problem still remains very much open. The 
interested reader is urged to look at Bloom [1992] and Kroo [1994] and references 
therein. 
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